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Abstract — We address the problem of finding the minimum 
decomposition of a permutation in terms of transpositions with 
non-uniform cost. For arbitrary non-negative cost functions, we 
describe polynomial-time, constant-approximation decomposition 
algorithms. For metric-path costs, we describe exact polynomial- 
time decomposition algorithms. Our algorithms represent a 
combination of Viterbi-type algorithms and graph-search tech- 
niques for minimizing the cost of individual transpositions, and 
dynamic programing algorithms for finding minimum cost cycle 
decompositions. The presented algorithms have applications in 
information theory, bioinformatics, and algebra. 



I. Introduction 

Permutations are ubiquitous combinatorial objects encoun- 
tered in areas as diverse as mathematics, computer science, 
communication theory, and bioinformatics. The set of all 
permutations of n elements - the symmetric group of order 
n, §„ - plays an important role in algebra, representation 
theory, and analysis of algorithms im-H. As a consequence, 
the properties of permutations and the symmetric group have 
been studied extensively. 

One of the simplest ways to generate an arbitrary permuta- 
tion is to apply a sequence of transpositions - swaps of two 
elements - on a given permutation, usually the identity per- 
mutation. The sequence of swaps can be reversed in order to 
recover the identity permutation from the original permutation. 
This process is referred to as sorting by transpositions. 

A simple result, established by Cayley in the 1860's, asserts 
that the minimum number of transpositions needed to sort a 
permutation so as to obtain the identity permutation is the 
difference of the size of the permutation and the number of 
cycles formed by the elements of the permutation. Cayley's 
result is based on a simple constructive argument, which re- 
duces to a linear-complexity procedure for breaking cycles into 
sub-cycles. Sorting a permutation is equivalent to finding the 
transposition distance between the permutation and the identity 
permutation. Since permutations form a group, the transposi- 
tion distance between two arbitrary permutations equals the 
transposition distance between the identity permutation and 
the composition of the inverse of one permutation and the 
other permutation. 

We address the substantially more challenging question: 
assuming that each transposition has a non-negative, but other- 
wise arbitrary cost, is it possible to find the minimum sorting 
cost and the sequence of transpositions used for this sorting in 
polynomial time? In other words, can one compute the cost- 
constrained transposition distance between two permutations 
in polynomial time? Although at this point it is not known 
if the problem is NP hard, at first glance, it appears to be 



computationally difficult, due to the fact that it is related to 
finding minimum generators of groups and the subset-sum 
problem fSj. Nevertheless, we show that large families of cost 
functions - such as costs based on metric-paths - have exact 
polynomial-time decomposition algorithms. Furthermore, we 
devise algorithms for approximating the minimum sorting cost 
for any non-negative cost function, with an approximation 
constant that does not exceed four. 

Our investigation is motivated by three different applica- 
tions. 

The first application pertains to sorting of genomic se- 
quences, while the second application is related to a gener- 
alization of the notion of a chemical channel (also known as 
trapdoor channel [6J). The third application is in the area of 
coding for storage devices. 

Genomic sequences - such as DNA sequences - evolved 
from one common ancestor, and therefore frequently contain 
conserved subsequences. During evolution or during the onset 
of a genomic disease, these subsequences are subject to muta- 
tions, and they may exchange their locations. As an example, 
genomes of cancer cells tend to contain the same sequence of 
blocks as normal cells, but in a reshuffled (permuted) order 
This finding motivated a large body of work on developing 
efficient algorithms for reverse-engineering the sequence of 
shuffling steps performed on conserved subsequences. With 
a few exceptions, most of the methods for sorting use rever- 
sals rather than transpositions, they follow the uniform cost 
model (each change in the ordering of the blocks is equally 
likely) and the most parsimonious sorting scenario (the sorting 
scenario with smallest number of changes is the most likely 
explanation for the observed order). Several approaches that do 
not fit into this framework were described in |7|, [8|. Sorting 
by cost-constrained transpositions can be seen as a special 
instant of the general subsequence sorting problem, where the 
sequence is allowed to break at three or four points. Unfortu- 
nately, the case of two sequence breakpoints, corresponding to 
so called reversals, cannot be treated within this framework. 

The second application arises in the study of chemical 
channels. The chemical channel is a channel model in which 
symbols are used to describe molecules, and where the channel 
permutes the molecules in a queue using adjacent transposi- 
tiono In information theory, the standard chemical channel 
model assumes that there are only two molecules, and that the 
channel has only two states - hence the use of adjacent trans- 
positions. If all the molecules are different, and the channel is 

'Usually, the channel is initialized by a molecule that may appear in the 
queue as well. 



allowed to output molecules with time-varying probabilities, 
one arrives at a channel model for which the output is a cost- 
constrained permutation of the input. Finding the minimum 
cost sequence decomposition therefore represents an important 
step in the maximum likelihood decoding algorithm fort the 
channel. 

The third application is concerned with flash memories 
and rank permutation coding (see f9] and fTOl). In this 
case, one is also interested in sorting permutations using 
adjacent transpositions and computing the Kendall distance 
between permutations ifTTI . If one considers more precise 
charge leakage models for memory cells, the costs of ad- 
jacent transpositions become non-uniform. This can easily 
be captured by a transposition cost model in which non- 
adjacent transpositions have unbounded cost, while the costs of 
adjacent transpositions are unrestricted. Hence, the proposed 
decomposition algorithms can be used as part of general soft- 
information rank modulation decoders. 

Our findings are organized as follows. Section II introduces 
the notation followed in the remainder of the paper, as 
well as relevant definitions. Sections III and IV contain the 
main results of our study: a three-stage polynomial-time ap- 
proximation algorithm for general cost-constrained sorting of 
permutations, an exact polynomial-time algorithm for sorting 
with metric -path costs, as well as a complexity analysis of 
the described techniques. Section V contains the concluding 
remarks. 

II. Notation and Definitions 

A permutation tt of {1,2,--- ,n} is a bijection 
from {1,2,- ■ ■ ,n} to itself. The set of permutations of 
{1, 2, • • • , n} is denoted by S„, and is called the symmetric 
group on {1, 2, ■ • • , n}. A permutation can be represented in 
several ways. In the two-line notation, the domain is written 
on top, and its image below. The one-line representation is 
the second row of the two-line representation. A permutation 
may also be represented as the set of elements and their 
images. 

For example, one can write a permutation vr as vr (1) = 
3,7r(2) = l,7r(3) := 2,7r(4) = 5,7r(5) = 4, or more 
succinctly as tt = 31254, or in the two-line notation as 

12 3 4 5 
^31254 

Yet another way of writing a permutation is 
via a set of mappings, for example tt — 
{1 -5-3,2 -> 1,3^2,4^5,5^4}. It wifl be helpful 
to think of a permutation as a mapping from positions to 
objects. For example, tt (1) ~ 3 means object 3 occupies 
position 1. Alternatively, we can also say that element 1 is a 
predecessor of element 3. If not otherwise stated, the word 
predecessor will be henceforth used in this context. 

The product 7r27ri of two permutations tti and tt2 is the 
permutation obtained by first applying vri and then tt2 to 
{1, 2, • • • , n}, i.e., the product represents the composition of 
TTi and 7r2. 

The functional digraph of a function / : {1,2,--- ,n} — > 
{1,2,- •• ,n}, denoted by G (f), is a directed graph with 



vertex set {1, 2, • • • ,n} and an edge from i to f (i) for each 
i e {l,2,---,n}. We use the words vertex and element 
interchangeably. For a permutation vr of {1, 2, • • • , n}, Q (vr) 
is a collection of disjoint cycles, since the in-degree and out- 
degree of each vertex is exactly one. Each cycle can be written 
as a fc— tuple a = (0102 ■ • • a^), where k is the length of 
the cycle and a^+i — a (ui). For each cycle of length k, the 
indices are evaluated modulo k, so that 0^+1 equals ai. A 
planar embedding of Q (vr) can be obtained by placing vertices 
of each of the disjoint cycles on disjoint circles. We hence 
reserve the symbol a for single cycles, and vr for multiple 
cycle permutations. 

We use Q (vr) to refer to the planar embedding of the 
functional digraph of vr on circles, as well as the functional 
digraph of vr. As a convention, we do not explicitly indicate 
the direction of edges on the circle. Instead, we assume a 
clockwise direction and treat Q (vr) as a non-directional graph, 
unless otherwise stated. 

A cycle of length two is called a transposition. A trans- 
position decomposition r (or simply a decomposition) of a 
permutation vr is a sequence tm- ■ ■ ti of transpositions ti 
whose product is w. Note that the transpositions are applied 
from right to left. A sorting s of a permutation tt is a sequence 
of transpositions that transform vr into t, where i denotes the 
identity element of §„. In other words, svr = t. Note that a 
decomposition r in reverse order equals a sorting s of the 
same permutation. 

The cycle representation of a permutation is the list of 
its cycles. For example, the cycle representation of 31254 
is (132) (45). Cycles of length one are usually omitted. 
The product of non-disjoint cycles is interpreted as a prod- 
uct of permutations. As an illustration, (124) (213) = 
((124) (3)) ((213) (4)) = (2) (134). 

A permutation vr is said to be odd (even) if the number of 
pairs a, 6 e {1, 2, • • • ,n} such that a < b and tt (a) > tt (6) is 
odd (even). If a permutation is odd (even), then the number of 
transpositions in any of its decompositions is also odd (even). 

The following definitions regarding graphs G = (V, E) will 
be used throughout the paper. An edge with endpoints u and 
V is denoted by {uv) E E. A graph is said to be planar if it 
can be embedded in the plane without intersecting edges. The 
subgraph of G induced by the vertices in the set 5 C V is 
denoted by G [S]. The degree of a vertex z; in G is denoted by 
degQ (v) or, if there is no ambiguity, by deg {v). Deletion of 
an edge e from a graph G is denoted by G — e and deletion of 
a vertex v and its adjacent edges from G is denoted hy G — v. 
The same notions can be defined for multigraphs - graphs in 
which there may exist multiple edges between two vertices. 

We say that an edge e in G is a cut edge for two vertices 
a and b, denoted by a,6-cutedge, if in G — e there exists no 
path between a and b. The well known Menger's theorem lfT2l 
asserts that the minimum number of edges one needs to delete 
from G to disconnect a from b is also the maximum number 
of pairwise edge-disjoint paths between a and 6. This theorem 
holds for multigraphs as well. 

Let T (t) be a (multi)graph with vertex set {1, 2, • • • , n} 
and edges (aibi) for each transposition ti = [aibi) of r. We 
use the words transposition and edge interchangeably. The 



embedding of T [t] with vertex set {1, 2, • • • , n} into Q (tt) 
is also denoted by T{t). In the derivations to follow, we 
make frequent use of the spanning trees of the (multi)graphs 
T (t) ,Q (tt) and Q (tt) U T (t). A spanning tree is a standard 
notion in graph theory: it is a tree that contains all vertices of 
the underlying (multi)graph. 

We are concerned with the following problem: given a non- 
negative cost function ip on the set of transpositions, the cost of 
a transposition decomposition is defined as the sum of costs of 
its transpositions. The task is to find an efficient algorithm for 
generating the Minimum Cost Transposition Decomposition 
(MCD) of a permutation tt G S„ . The cost of the MCD of 
a permutation n under cost function ip is denoted by M^ (tt). 
If there is no ambiguity, the subscript is omitted. 

For a non-negative cost function (p, let K. {(p) be the undi- 
rected complete graph in which the cost of each edge (ab) 
equals ip (a, b). The cost of a graph G C K. (ip) is the sum of 
the costs of its edges. 



cost (G) 



E 

{ab)eG 



ip{a,b). 



The shortest path, i.e., the path with minimum cost, between 
i and j in K, (ip) is denoted by p* {i,j). 

The following definitions pertaining to cost functions are 
useful in our analysis. A cost function (p is a metric if for 
a,b,c G {1,2, • • • ,n} 

'-P {o-, c) < p (a, b) + p {b, c) . 

A cost function ip is a metric-path cost if it is defined in terms 
of a weighted path, denoted by 9s. The weights of edges (uv) 
in Qs are equal to p {u, v), and the cost of any transposition 
(ij) equals 



^ihj) 



t=i 



p{ct,ct+i), 



where ci • • -qq+i, ci — i, q+i = j, represents the unique 
path between i and j in Qg. The path Qg is called the defining 
path of p. A cost function p is an extended-metric-path cost 
function if for a defining path 8s, p («, j) is finite only for the 
edges [ij) of the defining path, and unbounded otherwise. 

Applying a transposition [ab] to a permutation vr is equiv- 
alent to exchanging the predecessors of a and b in ^(vr). 
We define a generalization of the notion of a transposition, 
termed h-transposition, where the predecessor of a can be 
changed independently of the predecessor of b. For example, 
let a, b,c,d ^ {1, 2, • • • , n} and let tt (c) = a and tt (d) — b. 
Let vr' — (c, (a — J> 6)) tt, where we used (c, (a —> fo)) to denote 
an h-transposition. This h-transposition takes c, the predeces- 
sor of a, to b, without modifying the predecessor of b. That is, 
we have a mapping in which tt' (c) — tt' (d) = b, and a has no 
predecessor. Note that tt' is no longer a bijection, and several 
elements may be mapped to one element. A transposition 
represents the product of a pair of h-transpositions, as in 

(ab) TT = {tt-^ (a) , (a -^ b)) {tt'^ (b) , {b -^ a)) tt. 

An h-decomposition h of a permutation tt is a sequence of 
h-transpositions such that hi = tt. Similar to transpositions, 
a cost ifj (a, &) > can be assigned to h-transpositions 



(c, (a —;> 6)), where c is the predecessor of a. Note that the 
cost Ip is not dependent on c. We say that the transposition 
cost p and the h-transposition cost ip are consistent if for all 
transpositions {ab) it holds that p (a, b) = ijj (a, b) + -0 [b, a). 
For a permutation tt and a transposition {ab), it can be easily 
verified that {ab) tt consist of one more (or one less) cycle than 
TT if and only if a and b are in the same cycle (in different cy- 
cles). Since the identity permutation has n cycles, a Minimum 
Length Transposition Decomposition (MLD) of tt has length 
n—i, where I denotes the number of cycles in tt. The minimum 
cost of an MLD of tt, with respect to cost function p, is 
denoted by L^ {tt). For example, (132) (45) = (45) (23) (12) 
is decomposed into three transpositions. In particular, if tt is 
a single cycle, then the MLD of the cycle has length n ~ 1. 
A cycle of length k has k^^'^ MLDs in). An MCD is not 
necessarily an MLD, as illustrated by the following example. 

Example 1. Consider the cycle a = (1 • • ■ 5) with 

p}{i,i -\- 1) — 3 and p{i,i + 2) ~ 1. It is easy to verify that 
(14) (13)(35)(24)(14)(13) is an MCD of a with cost six, i.e., 
M (cr) = 6. However, as we shall see later, the cost of a 
minimum cost MLD is eight, i.e., L {cr) = 8. One such MLD 

is (14) (23) (13) (45) |14J. D 

Our approach to finding the minimum cost decomposition 
of a permutation consists of three stages: 

1) First, we find the minimum cost decomposition for each 
individual transposition. In particular, we show that the 
minimum cost decomposition of a transposition can be 
obtained by recursively substituting transpositions with 
triples of transpositions. This step is superfluous for the 
case when the cost function is a metric. 

2) In the second step, we consider cycles only and assume 
that each transposition cost is optimized. Cycles have the 
simplest structure among all permutations, and further- 
more, each permutation is a collection of cycles. Hence, 
several approximation algorithms operate on individual 
cycles and combine their decompositions. As part of this 
line of results, we describe how to find the minimum cost 
MLD and show that its cost is not more than a constant 
factor higher than that of the corresponding MCD. We 
also present a particularly simple-to-implement class of 
decompositions whose costs lie between the cost of a 
minimum MLD and a constant multiple of the cost of 
an MCD. 

3) We generalize the results obtained for single cycles to 
permutations with multiple cycles. 

III. Optimizing Individual Transposition 

Let r be a transposition decomposition and let {ab) be a 
transposition in r. Since a transposition is an odd permutation, 
it may only be written as the composition of an odd number 
of transpositions. For example. 



{ab) = {ac) {be) {ac) , 



(1) 



where c 6 {1, 2, • • • ,n} and c j^ a,b. It is straightforward to 
see that any decomposition of a transposition of length three 
must be of the form ([TJ, with a possible reversal of the roles 
of the elements a and b. 



If If {ah) > 2ip {ac) + (p {be) , then replacing {ab) by 
{ac) {be) {ac) reduces the overall cost of {ab). Thus, the first 
step of our decomposition algorithm is to find the optimal cost 
of each transposition. As will be shown, it is straightforward to 
develop an algorithm for finding minimum cost decompositions 
of transpositions of the form ([T]i. One such algorithm - Alg.[T] 
- performs a simple search on the ordered set of transpositions 
in order to check if their product, of the form of ([T]), yields 
a decomposition of lower cost for some transposition. It then 
updates the costs of transpositions and performs a new search 
for decompositions of length three that may reduce some 
transposition cost. 

The optimized costs produced by the algorithm are denoted 
by If*. Note that ip*{a,h) < 2ip*{a,x) + ip*{b,x), for any 
X ^ a,b. Although an optimal decomposition of the form 
produced by Alg. [TJs not guaranteed to produce the overall 
minimum cost decomposition of any transposition, we show 
that this is indeed the case after the expositions associated 
with Alg. \T\ 

Observe that if the cost function is such that 

(p{b,c) + 2(p{a,c)>(p{a,b), a,b,c e {1,2,- ■ ■ ,n}, (2) 

as in Example 1, Alg.[T]is redundant and can be omitted when 
computing the MCD. In particular, if the cost function is a 
metric, then Alg. [T]is not needed. 

The input to the algorithm Alg. [T] is an ordered list il of 
transpositions and their costs. Each row of fl corresponds to 
one transposition and is of the form [{ab) \(p {a, b)]. Sorting of 
fi means reordering its rows so that transpositions are sorted 
in increasing order of their costs. The output of the algorithm 
is a list with the same format, but with minimized costs for 
each transposition. 



Al 


gorithm 1 Optimize-Transposition-Costs(O) 


1 


Input: n (the list of transpositions and their cost) 


2 


Sort il 


3 


for ii- 2: \n\ do 


4 


{aibi)^n{i) 


5 


(j>i <- (p{ai,bi) 


6 


for j — 1 : i — 1 do 


7 


(0262) ^ n{j) 


8 


(f)2 <- ^{a2M) 


9 


if {ai6i} 0(0262} 7^0 then 


10 


aconi ^ {ai,6i}n {02,62} 


11 


{03,63} <- {01,02,61,62} - {acorn} 


12 


if (f)! + 2(/)2 < v?(a3, 63) then 


13 


update 93(03,63) in fl 


14 


Sort Vt 



Lemma 2. Alg.\l\optimizes the costs ip of all transpositions 
with respect to the triple transposition decomposition. 

Proof Let Vii be the list Q, at the beginning of iteration 
i, obtained immediately before executing line 4 of Alg. [T] 
We prove, by induction, that transpositions in Vti{l : i) have 
minimum triple decomposition costs that do not change in 



subsequent iterations of the algorithm, and that the transpo- 
sitions in rii(i + 1 : \n\) cannot be written as a product of 
transpositions exclusively in ni{l : i — 1) that have smaller 
overall cost. 

The claim is obviously true for i ~ 2. 

Assume that the claim holds for i. Let ti — ili{i) and 
consider s G fli{i + 1 : \^\). By the induction assumption, s 
cannot be written as a product of transpositions exclusively in 
ili{l : i — I) having smaller overall cost. Thus, the cost of s 
may be reduced only if one can write s as ^2^1^21 where ^2 £ 
il{l : i — 1). The list fli+i is obtained after considering all such 
transpositions, updating ip and sorting fli. The transposition 
of minimum cost in rii+i(i + 1 : |0|) is ili+i(i + 1). Now 
ili+i{i + 1 : \rt\) cannot be written in terms of transpositions 
in r2i+i(l : i) only, and hence the cost of any transposition 
in 57i+i(i + 1 : \ft\) cannot be reduced below the cost of 
^i+i(* + !)■ Hence, the cost of rii+i(i + 1) is minimized. ■ 

Example 3. The left-most list in ^ represents the input il 
to the algorithm, with transpositions in increasing order of 
their costs. The two lists that follow represent updates of H. 
produced by Alg. [T] In the first step, the algorithm considers 
the transposition (13), for i = 2, and the transposition 
(34), for j — 1. Using these transpositions we may write 
(34) (13) (34) = (14). The initial cost of (14) is 12 which 
exceeds 2p {3, A) + p {1, 3) — 8. Hence, the list representing 
il is updated to form the second list in (O. Next, for i — 3 
and j = 1, the algorithm considers (24) and (34). Since 
(34) (24) (34) = (23), we update the cost of (23) from 23 
to 11 as shown in the third list in ^. Additional iterations of 
the algorithm introduce no further changes in the costs. 



r (34) 


2 




r (34) 


2 




" (34) 


2 


(13) 


4 




(13) 


4 




(13) 


4 


(24) 


7 




(24) 


7 




(24) 


7 


(14) 


12 




iU) 


8 




(14) 


8 


(12) 


15 




(12) 


15 




{23) 


11 


L (23) 


23 




[ (23) 


23 




_ (12) 


15 



(3) 



D 
Upon executing the algorithm, the cost of each transposition 
is set to its minimal value. Only after the last stage of the MCD 
approximation algorithm is completed will each transposition 
be replaced by its minimal cost decomposition. For each index 
i the number of operations performed in the algorithm is 
0(|r2|). Thus, the total complexity of the algorithm is 0(|ri| ). 
Since \n\ is at most equal to the number of transpositions, 
we have \il\ — (2). Hence, the complexity of Alg. [T] equals 
0(n4). 

In the analysis that follows, denote the optimized transpo- 
sition costs by the superscript *, as in (p*. 

Since the transposition costs are arbitrary non-negative 
values, it is not clear that the minimum cost decomposition 
of a transposition is necessarily of the form generated by Alg. 
[1] This algorithm only guarantees that one can identify the op- 
timal sequence of consecutive replacements of transpositions 
by triples of transpositions. Hence, the minimum cost of a 
transposition (06) may be smaller than (p*{a, 6), i.e. there may 



be decompositions of length five, seven, or longer, which allow 
for an even smaller decomposition cost of a transposition. 

Fortunately, this is not the case: we first prove this claim 
for decompositions of length five via exhaustive enumeration 
and then proceed to prove the general case via the use of 
Mengers's theorem for multigraphs [IJ. We choose to provide 
the example of length-five decompositions since it illustrates 
the difficulty of proving statements about non-minimal de- 
compositions of permutations using exhaustive enumeration 
techniques. Graphical representations, on the other hand, allow 
for much more general and simpler proofs pertaining to non- 
minimal decompositions of transpositions. 

We start by considering all possible transposition decom- 
positions of length five, for which the transposition costs are 
first optimized via Alg. [T| In other words, we investigate if 
there exist decompositions of (ab) of length five that have 
cost smaller than (p*{a, b). Once again, observe that the costs 
of all transpositions used in such decompositions are first 
optimized via a sequence of triple-transposition decomposi- 
tions. To reduce the number of cases, we present the following 
lemma restricting the possible configurations in a multigraph 
corresponding to the decomposition of a transposition (ab). 

Lemma 4. Let t be a decomposition of a transposition (ab). 
The multigraph A^ = T (t), where t does not contain {ab), 
has the following properties: 

1) Both a and b have degree at least one. 

2) The degree of at least one of the vertices a and b is at 
least two. 

3) Every vertex of A^ — T {t), other than a and b, appears 
in a closed path (cycle) with no repeated edges in 7W. 

Proof The proof follows from the simple observations 
that : 

1) In order to swap a and b, both a and b must be moved. 

2) If both vertices a and b have degree one, then a and b 
are moved exactly once. This is only possible only if 
{ab) e T. 

3) Let r = tm---t2ti. Let ti be the transposition with 
the smallest index i that includes x £ V{M). In the 
permutation ti — ti---t2ti, x is not in its original 
location but rather occupies the position of another 
element, say, y. As we shall see in the proof of Lemma 
(|9]l and Example (fTOl i. this means that there is a path 
from X to y in T(ti). Similarly, there must exist a path 
from yto X inT {tm ■ ■ ■ ii+2^i+i)- Thus there is a closed 
path with no repeated edges from x to itself in A4. 

■ 
Let xi,X2,- ' ' I ^N be vertices included in the decomposi- 
tion T other than a and 6. If E{A4) denotes the number of 
edges in the multigraph Ai — T{t), then 

N 

2 1^ {M)\ = J2 deg (^0 + deg (a) + deg (6) . 

i=l 

From parts [Hand 12] of Lemma |4] note that deg (a) + deg (6) > 
3 and, from part [3] it holds that deg(xi) > 2. Hence, 



2 \E {M)\ >2N + 3, and since N has to be an integer, 

N<[\E{M)\-^\ = \E{M)\-2. (4) 

Suppose that r — tr^t4t^t2ti is the minimum cost decompo- 
sition of {ab) with cost (f), and that the cost of the optimal 
decomposition produced by Alg. [T] exceeds 4>. Then there is 
no vertex x such that 

Gi = {{ax) , {ax) , {bx)} 

is a subset of edges in the multigraph M since, in that case, 

f* (a, b) < 2ip* (a, x) + ip* (6, x) < </>. 

Also, there exists no pair of vertices x, y such that 

G2 = {{ax) , {bx) , {ay) , {by)} 

is a subset of edges in the multigraph Ai. To prove this claim, 
suppose that G2 C E{Ai). Without loss of generality, assume 
that 



Then, 



ip* {a, x) + (fi* {b, x) < (f* {a, y) + (p* {b, y) . 

(f* {a, b) < 2ip* {a, x) + ip* (6, x) 
<2ip*{a,x) + 2ip*{b,x) 

<C0St(G'2) 

<6. 



Hence, any decomposition of length five that contains G2 
must have cost at least ip* {a, b). 

For any five-decomposition r, we have \E {A4)\ — 5 and, 
thus, A^ < 3. We consider all five-decompositions of {ab) such 
that A4 is Gi— free and G2— free, and which contain at most 
five vertices in M.. Assume that the three extra vertices, in 
addition to a and b, are c, d, and e. We now show that for 
each decomposition of length five, there exists a decomposition 
obtained via Alg. [T]with cost at most (p, denoted by either fi 
or /i'. The following scenarios are possible. 

1) Suppose that deg (a) = 2 and deg {b) — 1. Furthermore, 
suppose that there exist a vertex that is adjacent to both 
a and b in A4. Without loss of generality, assume that 
{ad) e M, {bd) e M (Figure [Tall. We consider two 
cases, depending on the existence of the edge {cd) in 
M. 

First, assume that {cd) G M (Figure [Tb]l. If ip* {a, c) + 
'P* (c, d) < (p* {a, d), then the decomposition 



M 



{ac) {cd) {bd) {cd) {ac) 



has cost at most 0. Note that fi can be obtained from 
Alg. [Tj since 

/i = {ac) {be) {ac) = {ab) . 

On the other hand, if ip* {a, c) + p* (c, d) > p* (a, d), 
then the decomposition /i' = {ad) {bd) {ad) has cost at 
most ip. 

Next assume that {cd) ^ A4. Since both c and d each 
must lie on a cycle, the only possible decompositions 




(a) 




(b) 




(c) 




(d) 




(e) 




(f) 




(g) 




(h) 
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a) 



Figure 1: Possible Gi— and 6*2 —free configurations for A4 ~ T (t) when t is a five-decomposition of (ab). Note that other 
configurations can be obtained from these by relabehng a and b, and relabehng c, d, and e since they are symmetric. 



of {ab) are shown in Figure \Tc\ Now, if ip* (ad) < 
if* {d, e) + (fi* (e, c) + (f* (a, c), then the decomposition 

/i = (ad) (bd) (ad) 

has cost at most (p. On the other hand, if ip* (ad) > 
ip* {d, e) + (fi* (e, c) + ip* (a, c), then the decomposition 



/^' = (ac) (ec) (ed) (M) (ed) (ce) (ac) 



(5) 



has cost at most 
Alg. [T] since 



Note that /i' can be obtained from 



/i' = (ac) (ec) {be) (ec) (ac) 
= {ac) {cb) {ac) 
= {ab) . 

2) Suppose that deg (a) — 2 and deg {b) — 1, but that there 
is no vertex adjacent to both a and b. Without loss of 
generality, assume c and d are adjacent to a and e is 
adjacent to b (Figure [Tdl l. Since c, d, and e each must 
lie on a cycle, one must include two more edges in the 
graph, as shown in Figure (Te] Since d and c have a 
symmetric role in the decomposition, we may without 
loss of generality, assume that p* (a, c) + tp* (c, e) < 
(p* (a, d) + (p* (d, e). Let /i be equal to 

/I = {ac) {ce) {eb) {ce) {ac) . 

Similarly to (|5]), it is easy to see that the cost of fi is at 
most (f) and that it can be obtained from Alg. ([T]i. 

3) Assume that deg (a) = deg (6) = 2 (Figure [TJ. Since 
e and c must lie on a cycle, the fifth transposition 
in the decomposition must be (ec) (Figure [Tg| . If 
<y9* (a, d) + ip* {b, d) < ip* {b, e) + ip* (e, c) + (/?* (c, a), 
then the decomposition 

/i = {ad) {bd) {ad) 

has cost at most <j). Otherwise, if ip* {a, d) + (p* {b, d) > 
ip* {b, e) + (p* (e, c) + ip* (c, a), the decomposition 

/i' = {ac) {ec) {be) {ce) {ac) 



has cost at most cp. Note that both /i and //' represent 
decompositions of a form optimized over by Alg. [T] 

4) Suppose that deg (a) = 3, deg (6) = 1, and that all edges 
adjacent to a and b are simple (not repeated). Without 
loss of generality, assume that e is adjacent to both a 
and b (Figure fThl). One edge must complete cycles that 
include c, d, and e. Since creating such cycles with one 
edge is impossible, this configuration is impossible. 

5) Suppose that deg (a) = 3, deg (6) = 1, one edge 
adjacent to a appears twice, and there is a vertex adjacent 
to both a and b. Without loss of generality, assume that 
this vertex is d (Figure [Til). Since d must be in a cycle, 
it must be adjacent to the "last edge", i.e., the fifth 
transposition. If the last edge is (ed), then one more 
edge is needed to create a cycle passing through e. Thus, 
the last edge cannot be (ed). The only other choice is 
{cd) (Figure [lj]i. Now, if ip* (a, d) > ip* (c, d), then the 
decomposition 

/i = {ac) {cd) {bd) {cd) {ac) 

has cost at most (p. Otherwise, if (p* (a, d) < ip* (c, d), 
the decomposition 

m' = {ad) {bd) {ad) 

has cost at most (j). 

6) Suppose that deg (a) = 3, deg (6) = 1, and no vertex 
is adjacent to both a and b. The two possible cases 
are shown in Figures [Tkl and [TTl Since one edge cannot 
create all the necessary cycles, both configurations are 
impossible. 

Next, we state a general theorem pertaining to the optimality 

of Alg. n 

Theorem 5. The minimum cost decompositions of all trans- 
positions are generated by Alg. [7] 

Proof: The proof proceeds in two steps. First, we show 
that the multigraph AA for a transposition {ab) cannot have 
more than one a,6-cutedge. If Al has no a,6-cutedge, then 
there exist at least two edge-disjoint paths between a and b in 





(a) The edge (xiyi) is the only cut-edge 



(b) Edges (x'lj/i) and (x2y2) are the cut-edges 

Figure 2 



M. This claim follows by invoking Menger's theorem. The 
costs of the paths can be combined, leading to a cost of the 
form induced by a transposition decomposition optimized via 
([TJ. If the multigraph has exactly one a,6-cutedge, this case 
can be reduced to the case of no a,6-cutedge. This completes 
the proof. 

Before proving the impossibility of the existence of more 
than one a,6-cutedge, we explain how a a,&-cutedge imposes 
a certain structure in the decomposition of {ah). Consider the 
decomposition tmtm-i ■ ■ -ti ■ ■ -ti of (ab) and suppose that 
ti = (xij/i) is an a,6-cutedge, as shown in Figure l2al Let 
TTj — tj---ti. Since there exists a path between a to b, 
there also exists a path between a and xi that does not use 
the edge (a^ij/i). Thus, in 7W — (xij/i), a and xi are in the 
same "component". Denote this component by Bi. Similarly, 
a component, denoted by B2, must contain both the vertices b 
and 2/1. Since there is no transposition in tt^-i with endpoints 
in both Bi and B2, there is no element z E Bi such that 
7r,;-i (z) G i?2. Similarly, there is no element z G B2 such 
that TTi-i (z) G Bi. This implies that tt^-i (a) G Bi and 
TTi-i (b) G B2- Since (xij/i) is the only edge connecting Bi 
and i?2, we must have 

7r,^\ (a;i) = a, 
TT^-i (yi) = b, 
and 

71-.,"^ (a;i) = b, 

TT"^ (yi) = a- 

Now suppose there are at least two a,6-cutedges in T 
as shown in Figure |2b] Let the decomposition of (ab) be 
tm ■ ■ ■ ti ■ ■ ■ ti ■ ■ ■ ti, where U = {x2y2) and U ^ (xiyi), for 
some i < I. Define Bi, B2, and B^ to be the components 
containing a, yi, and j/2, respectively, in TW — (xij/i) — {x2y2)- 
By the same reasoning as above we must have 

7ril\ (2:2) = a, 
7r,l\ (2/2) == &. 

However, this cannot be true: after applying [xiyi), the 
successor of b belongs to Bi, and there are no other edges 
connecting the two components of the multigraph. Hence, the 
successor of b before transposing (a;2j/2) (that is, the successor 
of b in TTi-i) cannot be 1/2 ■ 

Since M cannot contain more than one a,&-cutedge, it 
must contain either one a,6-cutedge or it must contain no 
a,6-cutedges. 



Consider next the case when there is no a,6-cutedge in AA. 
In this case, based on Menger's theorem, there must exist at 
least two pairwise edge disjoint paths between a and 6. The 
cost of one of these paths has to be less than or equal to the 
cost of the other path. Refer to this path as the minimum path. 
Clearly, the cost of the decomposition of [ab) described by 
A^ is greater than or equal to twice the cost of the minimum 
path. 

Let the edges of the minimum path be 
{azi){ziZ2)...{zm-xZm){zmb), for some integer m. The 
cost of {ab) is greater than or equal to 

2tfi*{a,zi) + 2ip'{zi,Z2) H h 2ip* [zm-iz^] + 2(/p*(zm,b) > 

ifi*{a,zi) + 2ip*{zi,Z2) + ... + 2ip* {zm-iZm) + 2ip*{zm,h) > 

(p*{a,Z2) + 2(p*{z2,z:i) + ... + 2ip*{zm,b) > 

■ ■ ■ > if*{a,z,n) + 2(/p*(z„,6) > <^*(a,b), 

(6) 

and the cost of the decomposition associated with Ai cannot 
be smaller than the cost of the optimal decomposition pro- 
duced by Alg. [T] 

Next, consider the case when there is one a,&-cutedge in 
M. In this case, we distinguish two scenarios: when xi = a, 
and when xi ^ a. 

In the former case, the transposition (ayi) plays the role 
of the transposition (azi) and the remaining transpositions 
used in the decomposition lie in the graph M — (ayi). Since 
■M ~ (ayi) has no a,6-cutedge, continuing with line two of 
(6) proves that the cost of the decomposition associated with 
Ai cannot be smaller than (p*{a, b). 

In the later case, the procedure we described for the case 
xi = a is first applied to the multigraph containing the edge 
(xij/i) and the sub-multigraph containing the vertex a. As a 
result, the edge (xiyi) is replaced by (ayi), with cost greater 
than or equal to <f*{a, yi). Applying the same procedure again, 
now for the case xi — a, proves the claimed result. ■ 

As an illustration, one can see in Figures la- 11 that the 
multigraphs corresponding to decompositions of length five 
have no more than one a,&-cutedge. 

A quick inspection of Alg. 1 reveals that it has the structure 
of a Viterbi-type search for finding a minimum cost path in 
a transposition graph. An equivalent search procedure can be 
devised to operate on the graph /C (ip), rather than on a trellis. 
The underlying search algorithm is described in the Appendix, 
and is based on a modification of the well known Bellman- 
Ford procedure ifTSl . 

Definition 6. For an arbitrary path p = C1C2 ■ ■ ■ Cm in IC {(p). 



the transposition path cost is defined as 

m 

(p{p) = 2y^ (p{ci,c^+l) -max(p{ci,Ci+i). 

^ — ^ i 

i=l 

Let p (a, b) be a path with minimum transposition path cost 
among paths between a and b. That is, 

ip{p (a, b)) — inin (p (p) , 
p 

where the minimum is taken over all paths pin K, {ip) between 
a and b. Furthermore, let p* {a, b) be the standard shortest path 
between a and b in the cost graph /C {(p). 

Lemma 7. The minimum cost of a transpositions (ab) is at 
most Cp [p (a, b)). That is, 

LP* {a,b) < p{p{a,b)). 

Proof: Suppose that p = cqCi ■ ■ ■ CmCm+i where a = cq 
and b = Cm+i- Note that, for any < i < ?7i. 



(ab) = (cqCi • • ■ Ci_iCj) {Cm+lCm ' ' • Ci+2Ci+l) 

(CtCi+i) {Ci+lCi+2 ■ ■ ■ CmCm+l) (CiCi-i ■ ■ ■ CiCq) , (7) 

Choose i = argmaxj p {cj, c^+i) so that (ciCi+i) is the most 
costly edge in p (a, b). 

Each of the cycles in (Q can be decomposed using the edges 
of p as 

(coCi • • • C^-iCi) = {Ci-iCi) ■ ■ ■ (c2Ci) (cico) , 

{CiCi-1 ■ • • CiCo) = (cqCi) (C1C2) ■ • ■ (Cj_iCj) , 

(Ci+lCi+2 ■ • • CmCm+l) = (CmCm+l) {Cm-lCm) ' • ' (ci+2Ci+l) , 

{Cm+lCm ■ ■ ■ Ci+2Ci+l) = (ci+lCi+2) ' • ' {Cm-lCm) {CmCm+l) ■ 

(8) 
Thus, the minimum cost of (ab) does not exceed 

m 
3=0 



Lemma 8. The minimum cost of a transposition (ab) equals 
the minimum transposition path cost p{p (a, 6)). That is, 

p* {a,b) = ip{p{a,b)) . 

Proof Suppose t is the minimum cost decomposition of 
(ab). Let M = T{t), and note that cost (M) = ip* (a, b). 

In the proof of Theorem |5l we showed that Ai has at 
most one a,6-cutedge. Suppose that A4 has no a,6-cutedge. 
Then there are two edge-disjoint paths between a and b in 
A4. Define the minimum path as in Theorem |5] Suppose the 
minimum path is p — cqCi ■ ■ ■ CmCm+i- It is easy to see that 

■^ {p (a, b)) <<p{p) < cost (M) = p* (a, b) . 

From Lemma (|7]i, we have p}* (a, b) < p>{p (a, 6)). Hence, in 
this case, we conclude that p)* (a, b) = (p {p (a, b)). 

Next, suppose that Ai has one a,6-cutedge, as shown in 
Figure |2a] Menger's theorem implies that there are two edge- 
disjoint paths between a and xi and two edge-disjoint paths 



between b and j/i. Let pi be the path with smaller cost among 
the pair of paths between a and xi, and similarly, let p2 be 
the path with smaller cost between the pair of paths between 
b and j/i. Let p be the path obtained by concatenating pi, 
the edge (xiyi), and p2- Note that (p{p{a,b)) < p>{p) < 
cost [M) < p* (a, b). Since p* (a, b) < ip{p (a, &)), we have 
p* {a,b) = p>{p{a,b)). ■ 

It is easy to see that p* (a, 6) < 2 cost {p* (a, b)) since we 
have 

p* {a,b) = ip{p{a,b)) 
< p {p* (a, b)) 
<2cost{p*{a,b)). (9) 

Note that the Belbnan-Ford Alg. |4l presented in the Ap- 
pendix, finds the paths p in IC{p) between a given vertex s 
and all other vertices in the graph. 

Lemma [8] provides an easy method for computing p* {i,j) 
when there is only one path with finite cost between i and j 
in K.{p). For example, for an extended-metric cost function 
p, we have 

I 



V 



{hi) =2^.^(ct,ct+i) - max .^(cf,ct+i), (10) 



t=i 



where p{i,j) — c\-- -Ci+i, c\ — i, q+i = j, is the unique 
path between i and j in Qg- 

IV. Optimizing Individual Cycles 



We consider next the cost optimization problem over single 
cycles. First, we find the minimum cost MLD via a dynamic 
programming algorithm. The minimum cost MLD is obtained 
with respect to the optimized cost function p* of the previous 
section. For simplicity, we henceforth omit the superscript in 
the cost whenever there is no ambiguity in terms which cost 
function is used. 

We also present a second algorithm to find decompositions 
whose cost, along with the cost of the minimum cost MLD, 
is not more than a constant factor higher than the cost of the 
MCD. Both algorithms are presented for completeness. 

The results in this section apply to any cycle a. However, 
for clarity of presentation, and without loss of generality, we 
consider the cycle a — {12 ■ ■ ■ k). 

A. Minimum Cost, Minimum Length Transposition Decompo- 
sition 

Recall that the vertices of Q {a) are placed on a circle. 
For an MLD t of a permutation tt with i cycles, T (t) is 
a forest with £ components; each tree in the forest is the 
decomposition of one cycle of tt. This can be easily seen by 
observing that each cycle corresponds to a tree. The following 
lemma provides a rigorous proof for this statement. 

Lemma 9. The graph 7" (t) of an MLD t of a cycle a is a 
tree. 

Proof: First, we show that 7~(t) is connected. The 
decomposition r transform the identity permutation i to ct by 






4 • 






■ • ■ 
3 

(a) ■Ka = i = 1234 




(b) TTf, = 2134 



(C) TTc = 3142 



(d) TTrf = 3241 



> 



3 

(e) TTe = 4231 



Figure 3: In any decomposition of the transposition (14), there exists a path from 1 to 4. In this illustration, the iteratively 
added edges of the path are dashed. 



transposing pairs of elements. Note that every transposition 
exchanges the predecessors of two elements. In i, each element 
i is a fixed point (i.e., it is its own predecessor) and in a, i is 
the predecessor of <t (i). Thus there exists a path, formed by 
a sequence of transpositions, between i and <t (i). An instance 
of such a path is described in Example [TO] 

To complete the proof, observe that T (r) has k vertices and 
k — 1 edges, since an MLD of a cycle of length k contains 
k — 1 transpositions. Hence, T (r) is a tree. ■ 

As already pointed out, we provide an example that il- 
lustrates the existence of a path from z to cr (i) in the 
decomposition t of a, for the special case when cr is a cycle 
of length two. 

Example 10. Consider the cycle a — (14). It is easy to 
see that t ^ (34) (12) (23) (34) (12) is a decomposition of 
cr. Figure [3] illustrates a path from vertex 1 to vertex 4 in 
T{t). For instance, the transposition (12) in t corresponds 
to the edge (12) in T{t), as shown in Figure [3bl and the 
transposition (23) corresponds to the edge (23) etc. The cycle 
cr is a cycle in tt^ in Figure [3il The path from 1 to 4 in T (t) 
is 1^-2^3^ 4. D 

For related ideas regarding permutation decompositions and 
graphical structures, the interested reader is referred to llT6ll . 

The following definitions will be used in the proof of a 
lemma which states that Q (tt) U T (t) is planar, provided that 
T is an MLD of a. 

Let R be the region enclosed by edges of Q (cr). Let T be 
a tree with vertex set {1, 2, • • • , k}, such that Q (a) U T is 
planar. Since T is a tree with edges contained in R, the edges 
of T partition R into smaller regions; each of these parts is 
the enclosure of a subset of edges of Q (cr) U T and includes 
the vertices of these edges. These vertices can be divided into 
corner vertices, lying at the intersection of at least two regions, 
and inner vertices, belonging only to one region. In Figure l4al 
5 (cr) with vertices V = {1,2,3,4,5,6} is partitioned into 
four regions, i?i, i?2, ^3 and R^. In R2, vertices 1 and 3 are 
corner vertices, while vertex 2 is an inner vertex. 

Lemma 11. For an MLD t ^ti---tk-i of a, T (r) U t/ (a) 
is planar That is, for ti = (0102), where ai < 02, and tj — 
(6162 )> where bi < b2, if ai < foi < 02, then oi < &2 < 02- 
-I, 




(a) Ticr 



,■'■ ^ bi 
R* 



a l ^^2 
03 

(b) na 




Proof: Note that r cr = i. Let r^ 



ii-1 



•ti. Since 



Figure 4: A region is divided into two regions by transposition 
(ah). See proof of Lemma [TT] 



T is an MLD of a, TiO has i cycles. The proof proceeds by 
showing that for all 1 < i < fc, the following two claims are 
true: 

(I) Q (cr) U T {Ti) is planar 

(II) Each cycle of Ti corresponds to a subregion Rof Q (cr)U 
T [Ti). The cycle corresponding to R contains all of its inner 
vertices and some of its corner vertices but no other vertex. 

Both claims (I) and (II) are obvious for i = 1. We show 
that if (I) and (II) are true for Ti, then they are also true for 

Ti+l- 

Let ti — [ah). Clearly, Ti+icr = tiTia has one more cycle 
than Tia, and by assumption, Q {a) U T {Ti) is planar and 
partitioned into a set of subregions. Note that a and b are in 
the same cycle, and thus are inner or corner veritices of some 
subregion R* of ^ (cr) U T {n). The edge {ah) divides R* 
into two subregions, Ra and Ri, (without crossing any edge 
in Q {a) U T {Ti)). This proves (I). 

Let the cycle corresponding to R* be 

^ = {aai ■ ■ -aibbi ■ ■ -hv) 
as seen in Figure |4b] Then, 

{ah) n = {aai ■ ■ ■ ai) (55i • • • fer) • 

Now the cycles (aai ■ ■ ■ ai) and (661 • • ■ 6/') in Ti+i correspond 
to subregions Ra and Rb, respectively, as see in Figure |4c] 
This proves claim (II) since the cycle corresponding to each 
subregion contains all of its inner vertices and some of its 
corner vertices but no other vertex. ■ 

The following lemma establishes a partial converse to the 
previous lemma. 
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Figure 5: Example illustrating the proof of Lemma [121 t 



Lemma 12. For a cycle a and a spanning tree T over the 
vertices {1, 2, • • • , fc}, for which Q (a) U T is planar, there 
exists at least one MLD t of a such that T — T (t). 

Proof: We prove the lemma by recursively constructing 
an MLD corresponding to T. If fc = 2, then T has exactly 
one edge and the MLD is the transposition corresponding to 
that edge. For fc > 2, some vertex has degree larger than one. 
Without loss of generality, assume that deg (1) > I. Let 

r = max {u\ (lu) G T} . 

Since T is a tree, T — (Ir) has two components. These two 
components have vertex sets {1, • • ■ , s} and {s + 1, • • • , k}, 
for some s. It is easy to see that 



Let 



(1 ... A:) = (s + 1 ... fcl)(l .•• s). (11) 

T' = T[{1,...,4], 

T" = T[{s + l,---,k,l}]. 

Note that T' and T" have fewer than k vertices. Furthermore, 
T'ug ((1 • . • s)) and T'UG {{s + l---kl)) are planar Thus, 
from the induction hypothesis, (1 . . • s) and (s + 1 . . • fc 1) 
have decompositions r' and r" of length s — 1 and k — s, 
respectively. By ( fTTT l. t"t' is an MLD for a. ■ 

Example 13. In Figure |5] we have r — 10 and s = 8. The 
cycle (1 . . ■ 12) can be decomposed into two cycles, 

(1 ... 12) = (9 10 11 121) (12 ... 8). 

Now, each of these cycles is decomposed into smaller cycles, 
for example 

(91011121) = (9 10) (10 11 121), 

(1...8) = (81)(l..-7). 

The same type of decomposition can be performed on the 

cycles (910),- ■. ,(17). D 

Since any MLD of a cycle can be represented by a tree that 
is planar on the circle, the search for an MLD of minimum cost 



only needs to be performed over the set of planar trees. This 
search can be performed using a dynamic program, outlined in 
Alg.|2l Lemma [T4l establishes that Alg.|2]produces a minimum 
cost MLD. 



Algorithm 2 Min-Cost-MLD 


1: Input: Optimized transposition cost function $* where 


$* . = ip*{i^j) (Output of Alg.[B 


2 


C{i,j) <— oo for i,j e [fc] 


3 


C{i,i) ^0 fori e [fc] 


4 


C{i,i + l) ^ LP* {i, i + 1) for i e [fc] 


5 


for ? = 2 fc - 1 do 


6 


for i = 1 ■ . • fc — / do 


7 


j ^i + l 


8 


tor i < s < r < j do 


9 


A 4- C{i, s) + C(s + 1, r) + Cir, j) + ip* (i, r) 


10: if A<C{i,j) then 


11: C{i,j)^A 



Lemma 14. The output cost ofAlg. (|2|i, C (1, fc), equals L [a). 

Proof: The algorithm finds the minimum cost MLD of 
(1 • . • fc) by first finding the minimum cost of MLDs of shorter 
cycles of the form (« • • • j), where 1 < « < j < fc. We look at 
the computations performed in the algorithm from a top-down 
point of view. 

Let Ct ihj) be the cost of the decomposition of the cycle 
fji'.j — {i...j'^^ using edges of T[{i,--- ,j}], where T is 
an arbitrary planar spanning tree over the vertices {1, • • • , fc} 
arranged on a circle. For a fixed T, let r and s be defined as 
in the proof of Lemma [12] We may write 

(z...j) = (s + l...r)(ir)(r.-.j)(i-.-s) (12) 

where i < s < r < j. Thus 

Ct («, j) ^Ct{s + 1, r) + ip* (z, r) + Ct {r, j) + Ct {i, s) . 

(13) 

Define C{i,j) — Ct* (hj), where 

T* = argininCT(«,j) 

denotes a tree that minimizes the cost of the decomposition 
of {i ■ ■ ■ j). Then, we have 

C{i,j) = C{s* + l,r*) + ^*{t,r*) + C{r*,j)+C{t,s*), 

(14) 
where s* and r* are the values that minimize the right-hand- 
side of dTSl l under the constraint l<i<s<r<j. Since the 
cost of each cycle can be computed from the cost of shorter 
cycles, C {i,j) can be obtained recursively, with initialization 



Cii,i + l)^ip*{i,i + l) 



(15) 



The algorithm searches over s and r and computes C(l,fc) 
using ([T4l i and dTSl ). 

Although these formulas are written in a recursive form, 
Alg. [2] is written as a dynamic program. The algorithm first 
computes C {i,j) for small values of i and j, and then finds 
the cost of longer cycles. That is, for each 2 < Z < fc — 1 
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in increasing order, C {i,i + l) is computed by choosing its 
optimal decomposition in terms of costs of smaller cycles. ■ 

Example 15. As an example, let us find the minimum cost 
decomposition of the cycle a = (1234) using the above 
algorithm. Let $ be the matrix of transposition costs, with 



$ 



$* 







5 10 


3 


2 


3 


- 


9 


- - 





5 9 


3 


-02 


3 


- - 


7 


— — — 






(16) 



After optimizing the transposition costs in <1> via Alg. [T] we 
obtain $*, shown beneath $. From Alg. |2l we obtain 

C (1, 3) = C (2, 3) + ip*{l,2) = 7, (s, r) - (1, 2) , 
C(2,4) = C(2,3) + ^*(2,4)=5,(s,r) = (3,4). 

Consider the cycle (1234), where i = 1 and j — 4. The 
algorithm compares (2) — 6 ways to represent the cost of this 
cycle using the cost of shorter cycles. The minimum cost is 
obtained by choosing s — 2 and r = 4, so that 

C(1,4) = C(2,4) + ^*(1,4) = 8. 

Writing C as a matrix, where C {i,j) — Cij, we have: 



C = 







D 



Note that we can modify the above algorithm to also find 
the underlying MLD by using (fT2l i to write the decomposition 
of every cycle with respect to r and s that minimize the 
cost of the cycle. For example, from ( fT2] ). by substituting the 
appropriate values of r and s, we obtain 

(1234) = (234) (14) 

= (34) (24) (14) . 

The initiahzation steps are performed in O (fc) time. The 
algorithm performs a constant number of steps for each i, j, r, 
and s such that l<i<s<r<j<fc. Hence, the 
computational cost of the algorithm is O (/c'*). 

Note that Alg. |2] operates on the optimized cost function 
(p*, obtained as the output of Alg. [T] Figure |6] illustrates 
the importance of first reducing individual transposition costs 
using Alg. [T] before applying the dynamic program. Since 
the dynamic program can only use fc — 1 transpositions of 
minimum cost, it cannot optimize the individual costs of 
transpositions and strongly relies on the reduction of Alg. [T] 
for producing low cost solutions. In Figure|6l the transposition 
costs were chosen independently from a uniform distribution 
over [0, 1]. 
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Alg 2 only 

Alg 1 and Alg 2 



length of cycle 



Figure 6: The average minimum MLD cost vs the length of 
the cycle. Transposition costs are chosen independently and 
uniformly in [0,1]. 



B. Constant-factor approximation for cost of MCD 

For the cycle a — (12 • • • fc) and 1 < j < fc, consider the 
decomposition 

{] + 1 J + 2) (j + 2 j + 3) ■ • ■ 

(fc - 1 fc) (fcl) (12) (23) ■ • • (j - 1 j) • 

The cost of this decomposition equals 

^V'*(z,a(z))-^*(j,a(j)). 

To minimize the cost of the decomposition, we choose j 
such that the transpositions (j j + 1) has maximum cost. This 
choice leads to the decomposition 

{f + ir + 2) {f + 2 J* + 3) • ■ • 

(fc - 1 fc) (fcl) (12) (23) • • • (j* - 1 f) (17) 



where 



j* = argmax(/?* (j, cr(j)) 



The decomposition in (fTTI) is termed the Simple Transposition 
Decomposition (STD) of a. The cost of the STD of a, denoted 
by S {(j), equals 

Theorem 16. For a cycle a, M (a) < L{a) < S (a) < 
AM (a) . 

Proof Clearly, M {a) < L (a) . It is easy to see that the 
STD is itself an MLD and, thus, L (a) < 5(cr). For S (a). 
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we have 



5(a)=^^*(^,a(^))-^*0•^a(r)) 
]ip* {i,a{i)) 
<2^cost(p*(z,a(i))) (18) 



< 






where the last inequality follows from (|9]l. To complete the 
proof, we need to show that M (cr) > i J^i cost {p* {i, a («)))• 
Since this result is of independent importance in our subse- 
quent derivations, we state it and prove it in Lemma [19] ■ 

In order to prove Lemma [19] we first prove Lemma [17] and 
a corollary. 

Consider a transposition cost function ip and a h- 
transposition cost function t/j. Recall from Section HIl that 

(cr-^ (a) , (a ^ b)) {a~'^ (b) ,{b^a))a^ (ab) a. 

Lemma 17. The minimum cost of an h-decomposition of a is 
upper-bounded by the cost of the MCD of a, provided that Lp 
and "0 are consistent. 

Proof We prove the lemma by showing that there exists 
an h-decomposition of <t with cost M {a). Suppose that the 

MCD of (7 is T = tmtm-l ' ' ' ^1, whcrC ti — {ttibi), 1 < J < 

m, and where m is the length of the MCD. Let the permutation 
titi-i ■ ■ -ti be denoted by cr,;. The cost of the MCD is 



M^ (a) 



^ip{ai,bi). 



By replacing each transposition ti = (uibi) in 
T by a corresponding pair of h-transpositions 

{(J^'-i {at) , {tti ^ bi)) {al}^ {bi) , {bi ^ tti)) , one can 
see that 



M(cr) 



^{ip{ai,bi) + V'(6i,a,)) 



since ip and ip are consistent. Hence, the h-decomposition 

{(^0^ (^i) ' (^1 -^ «i)) {'^0^ (oi) ' (ai -^ bi)) 

has cost M (cr). In other words, decomposing each transposi- 
tion in an MCD into h-transpositions establishes the claimed 
result. ■ 

Corollary 18. For a fixed tp and a cycle a, one has 

M (cr) > maxminC,/, (H) 

where the maximum is taken over all h-transposition costs 
Ip consistent with ip, and the minimum is taken over all h- 
decompositions H of a with cost C^ (H). D 

Lemma 19. It holds that M {a) > i ^^ cost (p* (i, a (J))). 

Proof: Define ?/;i/2 ^s 

V'i/2 (a, b) = -01/2 {b, a)^ip (a, b) /2. 

It is clear that 01/2 is consistent with (p. Hence, we have 



M(cr) > maxminC,/, (iJ) 



> nmi Cv,,/2 (H) 



i (ab)£p*{i,a{i)) 



where (*) follows from the fact that the minimum cost h- 
decomposition uses the shortest path p* {i,a{i)) between i 
and (j{i). In this case, i becomes the predecessor of cr (i) 
through the following sequence of h-transpositions: 

(«, {v„i ^ a (i))) • • • (i, (wi -> V2)) (i, (i -^ wi)) 

where p*{i,a{i)) = iviV2 ■ ■ ■ v^o' (i) is the shortest path 
between i and a (i). ■ 

Observe that Theorem [16] asserts that a minimum cost MLD 
never exceeds the cost of the corresponding MCD by more than 
a factor of four. Hence, a minimum cost MLD represents a 
good approximation for an MCD, independent of the choice 
of the cost function. On the other hand, STDs and their 
corresponding path search algorithms are attractive alternatives 
to MLDs and dynamic programs, due to the fact that they are 
particularly simple to implement. 

Example 20. Consider the cycle a — (12345) and the cost 
function ip, with ip (2,4) = ip{2,5) = (p{3,5) = 1, and 
f{hi) — 100 for all remaining transposition. First, observe 
that the costs are not reduced according to Alg. [T] Neverthe- 
less, one can use the upper-bound for the transposition cost in 
terms of the shortest paths defined in the proof of Lemma [19] 
In this case, one obtains 

M (cr) > - (100 + 2 + 3 + 2 + 100) = 103.5. 

For example, the second term in the sum corresponds to a path 
going from 2 to 5 and then from 5 to 3. The cost of this path 
is two. 

Since M (cr) has to be an integer, it follows that M (cr) > 
104. 

The optimized cost function, ipj*, obtained from Alg.[T]gives 



1, 


(zj)e{(25),(35),(24)} 


3, 


(zj) e {(23) , (45)} 


5, 


(U) = (34) 


100, 


otherwise 



V* ihj) 



A minimum cost MLD can be computed using the 
dynamic program of Alg. [2] One minimum cost MLD 
equals tl = (45) (35) (12) (25), and has cost L (cr) = 
105. By substituting each of the transposition in tl with 
their minimum cost transposition decomposition, we obtain 
(24) (25) (24) (35) (12) (25). 

It is easy to see that 

Ts = (12) (23) (34) (45) 
is the STD of a with cost 5 (cr) = 100 + 3 + 5 + 3 = 111. 
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Hence, the inequality M (a) < L{<7) < S (a) < 4M (a) 
holds. Furthermore, note that a is an even cycle, and hence 
must have an even number of transpositions in any of its 
decompositions. This shows that AI (cr) = L (a) ~ 105. D 

C. Metric-Path and Extended-Metric-Path Cost Functions 

We show next that for two non-trivial families of cost 
functions, one can improve upon the bounds of Theorem 
[T6l For metric-path cost functions, a minimum cost MLD is 
actually an MCD, i.e., L (a) — M (a). For extended-metric - 
path costs, it holds that L (a) < 2M (cr). 

Note that metric -path costs are not the only cost functions 
which admit MCDs of the form of MLDs - another example 
includes star transposition costs. For such costs, one has 
vihi) = c« for all i,j except for one index i. The remaining 
costs are arbitrary, but non-negative. The proof for this special 
case is straightforward and hence omitted. 

Lemma 21. For a cycle a and a metric-path cost function ip, 

i^{^)<\Y.^V (*, <y (0) = \ E^ cost {p* (l, a (*))). 

Proof: The equality in the lemma follows from the 
definition of metric -path cost functions. 

We recursively construct a spanning tree T (cr) of cost 
B{(^) = \Y.iV {h 0- («)), such that G (a) U T (a) is planar. 
Since T (cr) corresponds to an MLD, L (cr) < B (a). 

The validity of the recursive construction can be proved by 
induction. For k = 2, T (cr) is the edge (12). Assume next 
that the cost of T (cr) for any cycle of length < fc — 1 equals 
Bia). 

For a cycle of length k, without loss of generality, assume 
that the vertex labeled 1 is a leaf in 9s, the defining path of 
ip, and that t is its parent. We construct T (a) from smaller 
trees by letting 

T (cr) = (It) U T ((2 ■ • • i)) U T ((t • • • k)) . 

See Figure |7] for an illustration. The cost of T (cr) is equal 
to B {{2-- ■t))+B{(t- ■ ■k))+ip{l,t). Note that we can write 

5((2---i)) = iE^(*''^«) + ^^(2,t) 

i=2 
= 9E^(*''^W)+9^(2,i)--^(l,2), 



fe-1 

B{{t---k))^-Y,vih'j{^)) + ^v(t,k) 

1 .^ 1 1 

= :;^vih^ (*)) + l^V (t, k) - -if (1, k) . 



'<\V 


1 


•- 

3 


• — 

5 


— • — 

2 


4 


6 


'v\^ 


1 


3 




2 




4 














3 


5 




4 




6 


(a) e (<t) U T 


'^) 






(b)e. 









Since (^(1,2) = (^(l,i) + ip{t,2) and (pil,k) = ip{l,t) + 
(p {t, k), it follows that 

B ((2 • • . i)) + B ((t • • . fc)) = B{{l---k))^cp (1, t) . 

This completes the proof of the Lemma. ■ 



Figure 7: Example illustrating the proof of Lemma 1211 (a) The 
cycle a — (12345). Edges of Q (a) are shown with dashed 
arcs. The edge (13), shown with a thick solid line, belongs 
to T{a). The tree r((3456)) consists of solid edges on the 
left hand side of (13) and T ((23)) consists of solid edges on 
the right hand side of (13). As stated in the proof, we have 
T (cr) = (13) U T ((23)) U T ((3456)). (b) The defining path 
Qs of ip. Vertex 1 is a leaf and 3 is its parent. 



Theorem 22. For a cycle a and a metric-path cost function, 
one has 

L{a)^M{a)^\Y.^{i,a{i)). 



Proof: Since L (cr) > M [a), it suffices to show that 

L{<y) < kE^vih^i^)) and Mia) > ^E^^i^'J^ 
Lemma 1211 establishes that L (a) < i ^. (/? (i,cr (i)). From 
Lemma [T9l it also follows that 

Af(^)> incest (p*(z,a(z))). 



Since (/? is a metric-path cost function, we have (p (i, a (i)) = 
cost {p* {i, a («))). This proves the claimed result. I 

Theorem 23. For extended-metric-path cost functions ipe 
Proof: We prove the theorem by estabhshing that 

(a) ,5-^ (b) 

icpe (<^) < 2^ cost {p* ii,a{i))) < 2M^^ {a) 



where p* {i, a (i)) is the shortest path between i and a (i) in 
/C {(fie) and is calculated with respect to the cost function ipe- 
Let Qs be the defining path of an extended-metric -path cost 
(fe- Consider the metric-path cost function, ip„i, with defining 
path 0s, and with costs of all edges (if) G Os doubled. If the 
edge (ij) ^ Qg, and if ciC2 • • • c;+i is the unique path from 
ci = i to Q+i = j in 8s, then 

I I 

'Pni{i,j) = ^(pm{ct,Ct+l) = 2^(pe{ct,Ct+l). 
t=l t=l 

By ( [Tol l, iff. {i,j) < (fim ihj), for all i,j. Hence, L^^ (cr) < 
^Vm ("') ■ Now, following along the same lines of the proof of 
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1 
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(b) MCD 



(c) STD 



Figure 8: MLD (a) and MCD (b) for a = (12345). Edge labels 
denote the order in which transpositions are applied 



Lemma |2T1 it can be shown that 

i^e (<^) < L^^ (a) (19) 

i 

= ^ cost (p* (i, a (i))) , 

i 

which proves (a). 

Note that Lemma [T9l holds for all non-negative cost func- 
tions, including extended-metric -path cost functions. Thus, 

i 

which proves (b). ■ 

Example 24. Consider the cycle a — (12345) and the 
extended-metric -path cost function of Example [T] 

By inspection, one can see that an MCD of a is 

(14) (13) (35) (24) (14) (13), with cost M (a) = 6. A mini- 
mum cost MLD of a is (14) (23) (13) (45), with cost L (a) = 
8. The STD is (12) (23) (34) (45), with cost S (a) = 12. Thus, 
we observe that the inequality L (a) < S (u) < 2M (a) is 
satisfied. D 



V. Optimizing Permutations with Multiple Cycles 

Most of the results in the previous section generalize to 
permutations with multiple cycles without much difficulty. We 
present next the generalization of those results. 

Let TT be a permutation in S„, with cycle decomposition 
aia2 ■ ■ ■ (Je. A decomposition of tt with minimum number of 
transpositions is the product of MLDs of individual cycles (7^. 
Thus, the minimum cost MLD of tt equals 

e 

L(^) = ^L(a,). 

t=i 

The STD of tt is the product of the STDs of individual cycles 

The following theorem generalizes the results presented for 
single cycle permutations to permutations with multiple cycles. 

Theorem 25. Consider a permutation tt with cycle decompo- 
sition (71(72 •• • <^h <^nd cost function (p. The following claims 
hold. 



1) S{7T)<2j:^COSt{p*{l,7T{t))). 

2) M{7^)>^J2^cost{p*{^,^T{^))). 

3) L{tt)<S (tt) < 4M (tt) . 

4) If if is a metric-path cost function, then 

M (tt) = i (tt) . 

5) If if is an extended-metric-path cost function, then 

L (tt) < 2M (tt) . 

Proof: 
1) For each cycle it holds that 

S {at) <2Y, cost {p* [i, at {i))) , 

which can be seen by referring to (fTST l in the proof of 
Theorem [T6l Thus, 

e 
5(7r)=^5(a,) 
t=i 

(. 

<Y,2j2cost{p*ii,nm 

i — 1 iG(Tt 

n 

= 2^cost(p*(*,7r(z))). 



i=l 



2) The same argument as in Lemma [19] applies without 
modifications. 

3) For each I < t < £, from the proof of Theorem [16] we 
have L{at) < S{at)- Consequently, 

e i 

L(7r) = ^L(a,)<^5(a,) = 5(^). 
t=i t=i 

Furthermore, from parts 1 and 2 of this theorem, it 
follows that ^(Tr) < 4M (n). Therefore L (tt) < 
S{tt) < 4Af (tt). 

4) From Lemma [211 for each at, it holds that 

^(^t)<^^cost(p*(i,7r(z))). 

ieat 

By summing over all cycles, we obtain 

1 " 

L{TT)<-Y^COSt{p*il,TT{l))). 

4=1 

The claimed result follows from part 2 and the fact that 

A/(7r) < i(7r). 

5) From the proof of Theorem [23] we have L (at) < 
J2iecr cost (p* (i,Tr {{))). By summing over all cycles, 
we obtain 

n 

L{n) <Y^ cost {p* {i, TT {€))) < 2M (tt) , 
j=i 

where the last inequality follows from part 2 of this 
theorem. 
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A. Merging cycles 

In Section IIVI we demonstrated that the minimum cost of 
an MLD for an arbitrary permutation represents a constant 
approximation for an MCD. The MLD of a permutation 
represents the product of the MLDs of individual cycles 
of the permutation. Clearly, optimization of individual cycle 
costs may not lead to the minimum cost decomposition of 
a permutation. For example, it may happen that the cost of 
transpositions within a cycle are much higher than the costs 
of transpositions between elements in different cycles. It is 
therefore useful to analyze how merging of cycles may affect 
the overall cost of a decomposition. 

We propose a simple merging method that consists of two 
steps: 

1 ) Find a sequence of transpositions 



tk-l 



SO that a' = t'tt is a single cycle. Ideally, this sequence 
should have minimum cost, although this is not required 
in the proofs to follow. 
2) Find the minimum cost MLD r of cr'. 

The resulting decomposition is of the form t'^^t. 

Suppose that tt has k cycles. Joining k cycles requires fc — 1 
transpositions. Hence, each ti is a transposition joining two 
cycles of tt. The cost of r' equals X]j=i f i^i^ ^0' where ti — 
(aibi). The cost of the resulting decomposition using r' and 
the single cycle MLD equals 



show that 



C = ^ (y9 (fli, bi) + L{a'). 



(20) 



Since tt = ti ■ ■ ■ tk<T', we also have 

L{<j') < 4M (a') < 4 ( y (^ (a„ 6,) + M{tt) ] . (21) 



Hence, from (|20] | and dSTT i. C is upper bounded by 



fc-l /fe-l N 

C < y (p {a^, bi) + 4:i^ip (a„ 6,;) + M{tt) 
1=1 \i=i y 



(22) 



< 5kipmax+4:M{TT), 



where ipmax is the highest cost in ip. The approximation ratio, 
defined as C/M{tt), is upper bounded by 



a < 4 + 



5fc ip> 



n — k if. 



max . Oh:/Tl ^rnax /•'7a\ 



1 — k/n ip„ 



which follows from the fact M{tt) > (n — k) (pmin, where 
'P7nin is the smallest cost in (p, assumed to be nonzero. 

Although a is bounded by a value strictly larger than four, 
according to the expression above, this does not necessarily 
imply that merging cycles is sub-optimal compared to running 
the MLD algorithm on individual cycles. Furthermore, if the 
MCDs of single cycles can be computed correctly, one can 



fc-i 



C = y(^(a„6,) + M(a') 



fc-i 
<2 ^<f{a^,b,) + M{Tr) 
1=1 



(24) 



< 2kipmax + M (tt) 

The approximation ratio in this case is upper bounded by 



(25) 
(26) 



a < 1 



Z/C ^max 



1 



2k/n Lp„ 



n k (fniin J- k/Jl (prnin 

Lemma 26. Let -k be a randomly chosen permutation from 
§„. Given that the MCDs of single cycles can be computed 
correctly, and provided that (pmax = o (n/ log n), a goes to 
one in probability as n —^ oo. 

Proof: Let X„ be the random variable denoting the 
number of cycles in a random permutation 7r„ G §„. It 
is well known that EXn = J21=i ^ = ^ {n) and that 

^x„(x„-i) = {EXrf - VUr G21- ^^'"^' ■^(") 

denotes the nth Harmonic number. Thus 



(27) 



EX.^ = O (Inn 



which shows that Xn/n — ;> in quadratic mean as n — ^ oo. 
Hence Xn/n — > in probability. By Slutsky's theorem ifTSl . 
a — >■ 1 in probability as n ^ oo. ■ 

In the following example, all operations are performed 
modulo 10, with zero replaced by 10. 

Example 27. Consider the permutation tt = (Ji(T2, where 
a^ = (173 9 5) and (72 = (2 8 410 6), and the cost function 



'fiihj) 



dii,j) = l 
otherwise 



where d{i,j) = min{|i — j\ , 10 — |i — j\}. Note that 

cost {p* {i,j)) ^d{i,j). 

where p* is the shortest path from i to j. We make the 
following observations regarding the decompositions of tt. 
1) MCD: We cannot find the MCD of tt, but we can easily 
obtain the following bound: 



1 ^° 

M(7^)>[-ycost(p*(^,^(^)))] 

1=1 



2-5-4 



20. 



2) MLD: As before, let the output of Alg. [T] be denoted 
by ip*. We have ip* {i,j) — 2d{i,j) — 1. The minimum 
cost MLDs for the cycles are 

(17395) = (19) (3 7) (13) (9 5), 
(284106) = (2 10) (4 8) (2 4) (6 10), 

each of cost 20. The MLD of tt is the concatenation of 
the MLDs of cti and a2'. 

TT = (2 10) (4 8) (2 4) (6 10) (1 9) (3 7) (1 3) (9 5) , 

with overall cost equal to 40. 
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3) STD: It can be shown that the STD of cti is 

ai = (1 7) (7 3) (3 9) (9 5) , 
(72 - (2 8) (8 4) (4 10) (10 6) , 

each with cost 28. The total cost of the STD is S (tt) = 
56. 

4) Merging cycles: Instead of finding the minimum cost 
MLD of each cycle separately, we may join the cycles 
and find the MLD of a larger cycle. Here, we find the 
MLD of cr' = (1 2) TT = (1 7 3 9 5 2 8 4 10 6). The cost of 
the minimum MLD of a' can be shown to be 37. Since 
the cost of the transposition (12) must also be accounted 
for, the total cost is 38. Observe that this cost is smaller 
than the MLD cost of part 2, and hence merging cycles 
may provide better solutions than the ones indicated by 
the bound ( |23] ) or as obtained through optimization of 
individual cycles. D 

VI. Conclusions 

We introduced the problem of minimum cost transposition 
sorting and presented an algorithm for computing a transpo- 
sition decomposition of an arbitrary permutation, with cost 
at most four times the minimum cost. We also described an 
algorithm that finds the minimum cost of each transposition 
in terms of a product of other transpositions, as well as an 
algorithm that computes the minimum cost/minimum length 
decomposition using dynamic programing methods. 

We also showed that more accurate solutions are possible 
for two particular families of cost functions: for metric-path 
costs, we derived optimal decomposition algorithms, while for 
extended-metric-path costs, we described a 2-approximation 
method. 

The algorithms presented in this paper are of polynomial 
complexity. Finding the minimum cost of a transposition has 
complexity 0{n'^). Given the optimized cost transpositions, 
the minimum length decomposition can also be constructed 
in 0{n'^) steps. Computing a decomposition whose cost does 
not exceed the minimum cost by more than a factor of four 
requires O(n^) steps as well. 
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Appendix 

We describe an algorithm for finding (f{p{a,b)). The 
algorithm represents a variant of the Belbnan-Ford procedure, 
described in detail in lil5J . 

Recall that for each path p = vi ■ ■ ■ Vm+i in /C {(p), we 
define two types of costs: the standard cost of the path. 



cost (p) ^^(f (i;,, Vi+i) , 



i=l 



and the transposition path cost. 



ip{p) = 2 cost (p) - max tp (v^ , w^+i ) . 



(28) 



(29) 



The goal is to find the path that minimizes the transposition 
path cost in ( |29] |. 

Before describing our algorithm, we briefly review the 
standard Single-Source Bellman-Ford shortest path algorithm, 
and its relaxation techniques. 

Given a fixed source s, for each vertex v ^ s, the algorithm 
maintains an upper bound on the distance between s and v, 
denoted by D (v). Initially, for each vertex v, we have D {v) — 
(p{s,v). 

"Relaxing" an edge (uv) means testing that the upper- 
bounds D (u) and D (v) satisfy the conditions. 



D{u) < D{v)+w, 
D{v) < D{u) +w, 



(30) 



where w denotes the cost of the edge {uv). If the above 
conditions are not satisfied, then one of the two upper-bounds 
can be improved, since one can reach u by passing through 
V, and vice versa. 

In our algorithm, we maintain the upper-bound for two types 
of costs. The source s is an arbitrary vertex in IC{(p). For a 
path between s and a vertex v, we use Di (u) to denote the 
bound on the minimum transposition path cost, and we use 
D2 (v) to denote the bound on twice the minimum cost of the 
path. From the definitions of these costs, it is clear that 

D2{u)<2w + D2{v), (31) 

D2 {v) <2w + D2 (u) , 

Di (u) < min {w + D2 (v) ,2w + Di (v)} , 

Di (v) < min {w + D2 (u) ,2w + Di (u)} . 

The relaxation algorithm for these inequalities, Alg. [3] is 
straightforward to implement. To describe the properties of the 
output of the Bellman-Ford algorithm, we briefly comment on 
a simple property of the algorithm, termed the path-relaxation 
property. 

Suppose p — vi ■ ■ ■ Vjn+i is the shortest path (in terms of 
( |28] ) or ( |29] |) from s — vi Xo u — Vm+i- After relaxing the 
edges (t;ii;2) , {v2Vz) , ■ • • (wmWm+i), in that given order, the 
upper-bound Di (u) (for i — 1,2) equals the optimal cost of 
the corresponding path. Note that the property still holds even 
if the relaxations of the edges (wif2) , (W2W3) , • • • , {vmVm+i) 
are interleaved by relaxations of some other edges. In other 
words, it suffices to identify only a subsequence of relaxations 
of the edges (t;ii;2) , (W2W3) , • • • (vmVm+i)- 

In the algorithm below, we use predi{v) to denote the 
predecessor of node v used for tracking the updates of the cost 
Di{v), i — 1,2, and {u,i),i — 1,2, to indicate from which 
of the two costs, minimized over in (ISTT l. u originated. Note 
that this notion of predecessor is not to be confused with the 
predecessor element in a two-line permutation representation. 

The Bellman-Ford algorithm performs n — 1 rounds of 
relaxation on the edges of the graph IC{(p). Lemma |28] proves 
the correctness of the algorithm. 

An example of the steps of Alg. |4] is given in Figure |9] 
Initially, only edges between a, b, and c and a have finite costs, 
as a result of steps 4-8 of the algorithm. Next, n — 1 passes 
are executed and in each of them all edges of the graph are 
relaxed. Edge (bd) is relaxed first, as seen in Figure |9b] Next, 
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Algorithm 3 Relax(m, v) 


1 


W •<— (/3(m, v) 


2 
3 
4 


if D2(u) > £'2(u) + 2u; then 
£'2(«)^£'2(u) + 2k; 
pred2{v) i~ (u, 2) 


5 
6 

7 


if D2(u) > D2iv) + 2w then 
£i2(u) ^ D2{v) + 2w 
pred2(u) <r~ (v, 2) 


8 

9 

10 


if Di{v) > D2{u) + w then 

Dl{v) <r- D2{u) +W 

prediiv) <r- (u, 2) 


11 
12 
13 


if Di{u) > D2{v) + w then 

Di{u) <- L'2(w) +w 
pred{u, 1) <- (w, 2) 


14 
15 
16 


if i:»i(i') > Di{u) + 2w then 
i:'i(w) ^i:>i(u) + 2-u; 
prediiv) ■<— (m, 1) 


17 
18 
19 


if i:»i(u) > Di{v) + 2w then 
i:»i(w) ^ /?!(«) + 2w 
predi{u) •<— (w, 1) 




Algorithm 4 Single-Source Bellman-Ford(s) 


1 

2 
3 
4 
5 
6 
7 

8 

9 

10 


Input: vertex s 

Output: p{s, u) for 1 < u < n 

for u <— 1 • • • n do 

Diiu) <— (/3(s, u) 

predi{u) <— s 

L'2(m) <- 2(^(s,u) 

pred2{u) <r- s 

for i <— 1 • • • n — 1 do 

for each edge [uv) e E{K{ip)) do 
Relax(u,i;) 


11 
12 
13 

14 


for u ^ 1 •• ■ n do 
initialize path at u 

backtrack min cost alg to recover path to s 
output p[s, u) 



relaxes all edges of the graph. Thus, there exist a subsequence 
of relaxations that relax (W1V2) , {^2^-^) , • ■ • , [vmVm+i), in 
that order. The proof for the claim regarding Di (u) follows 
by invoking the path-relaxation property and the fact that 
(/?* (s,m) = (p{p{s,u)). The proof for the claim regarding 
D2 [u) is similar. ■ 
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the relaxation of edge [cd) reduces the cost of Di (d) from 12 
to 10. Continuing with the algorithm, we obtain the final result 
in Figure |9j Note that in this example, the result obtained 
after the first pass is the final result. In general, however, the 
final costs may be obtained only after all n — 1 passes are 
performed. 

Lemma 28. Given n, a cost function ip, and a source s, after 
the execution of Alg. (|4|i, one has Di(u) — (p* {s,u) and 
D2{u) = 2 cost (p*(s, It)). 

Proof: Let p{s,u) — viV2---v,n+i be the path that 
minimizes (f (p) among all paths p between vi ^ s and 
Vm+i = u. Since any path p has at most n vertices, we have 
TO < n— 1. The algorithm makes n — 1 passes and in each pass 
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Figure 9: Single-Pair Bellman-Ford on a 6-vertex graph. The costs {Di (u) ,D2 (u)), are shown inside each vertex. 
Edges that are not drawn have weight oo. 



